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Amendments to the Claims 



Claim 1 (currently amended). A system for disambiguating speech input using one of 
voice mode interaction, visual mode interaction, or a combination of voice mode and 
visual mode interaction witti an application comprising: 

a speech disambiguation mec bftn^'^m^ 



an options and parameters component for receiving user parameters and application 
parameters for controlliny the speech disambiguation mechani sm, wherein both the 
user and the application can set the parameters to control said mechanism, and 
wherein the parameters include confidence thresholds goveroii ifr unymhipjous 
recognition and close matches: 



a speech recognition component that receives recorded audio, speech or a 
combination of audio and speech input and generates: 

one or more tokens corresponding to the speech input without translation of the 
langua ge in which the audio, speech, or a combination of speech and audio inp ut 
is received : and 

for each of the one or more tokens, a confidence value indicative of the likelihood 
that a given token correctly represents the speech input; 

a selection component that identijBes, according to a selection algorithm, which two 
or more tokens generated without translation of the language in which the speech, 
audio or combination of speech and audio input is received, and presents said tokens 
as alternatives to the tiser ore to bo pr e o e ntod to a uoer go altunmtivog^ whoroin said 
ah e mativoD oro woido of tokcno; 
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one or more disambiguation components that present the alternatives to tihe user in 
one of voice mode^ visual mode, or a combination of voice mode and visual mode, 
and receive an alternative selected by the user from Ifae nluralitv of alternatives 
presented to the user in one of voice mode, visual mode^ or a combination of voice 
mode and visual mode; and 

an output interface that presents the selected alternative to an application as input. 

Claim 2 (cancelled). The system of claim 1, wherein the disambiguation components and 
the application reside on a single computing device. 

Claim 3 (cancelled). The system of claim 1, wherein the disambiguation components and 
the application reside on sepamte computing devices. 

Claim 4 (original). The system of claim 1, wherein the one or more disambiguation 
components perform said interaction by presenting the user with alternatives in a visual 
mode, and by receiving the user's selection in a visual mode. 

Claim 5 (original). The system of claim 4, wherein the disambiguation components 
present the alternatives to the user in a visual form and allow the user to select from 
among the alternatives using a voice input 

Claim 6 (cancelled). The system of claiml , wherein the one or more disambiguation 
components perform said interaction by presenting the user with alternatives in a visual 
mode, and by receiving the user^s selection either in a visual mode, a voice mode, or a 
combination of visual mode and voice mode. 

Claim 7 (original). The system of claim 1, i^erein the selection component filters the 
one or more tokens ^^cording to a set of parameters. 

Claim 8 (original). The system of claim 7, wherein the set of parameters is user specified, 

3 

PA(S4l14^RCVDAT7l21!20084:42:05rap^^ 



07/21/2B08 16:55 8563740246 



AJANKHA 



PAGE 



Claim 9 (cancelled). The system of claiml, wherein the one or more disambiguation 
components disambiguates the alternatives in pimal iterative stages, whereby the first 
stage narrows the alternatives to a number of alternatives that is smaller than that initially 
generated by the selection component, but greater than one, and whereby the one or more 
disambiguation components operative iteiatively to narrow the alternatives in subsequent 
iterative stages. 

Claim 10 (cancelled). The system of claim 9, whereby the number of iterative stages is 
limited to a specified number. 

Claim 1 1 (currentiy amended). A method of processing speech input using one of voice 
mode interaction, visual mode interaction, or a combination of voice mode and visual 
mode interaction with an application comprising: 



a speech disambipu '^'"" map-^aniitm! 

receiving user parameters and application par ameters for controlling the speech 
disambiauati' -'^ mftrhanis m. wherein bo th the user and the application can set the 
pargmeters to control said fn^rhwi^m. and wherein the parameters include 
confidence thtediolds g(w *^;i»fi iifmmbiffiiiotis rer^frnitinn and rJoae matches: 



receiving a speech input from [[a]] die tiser, 
determining whether the speedi input is ambiguous; 

if the speech input is riot ambiguous, communicating a token representative of the 
speech input to an application as input to the application; and 



if die speech input is ambiguous; 
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selecting two or more tokens gmerated w ithout tranjglation of the language 
in which the speech, audio or combination of speech and audio input is 
received, and presenting said tokens as ahematives to bo proaent e d to the 
user QD oltomativo s , wherein ooid oltomotivos oxo wordo or tokong ; 

presenting the alternatives to the user in one of voice rnode^ visual mode, 
or a combination of voice and visual mode, and receiving a selection of an 
alternative #em bx the user from the plurality of alternatives resented to 
the user in one of voice mode, visual mode* or a combination of voice 
mode and visual mode; and, 

commujcuicating the selected alternative to the application as input to the 
application. 

Claim 1 2 (original). The method of claim 1 1, where the interaction comprises the 
concurrent use of said visual mode and said voice mode. 

Claim 1 3 (original); The method of claim 12, wherein the interaction comprises the user 
selecting from among the plural alternatives using a combination of speech and visual- 
based input. 

Claim 1 4 (original). The method of claim 1 1, wherein the interaction comprises the user 
selecting from among the plural alternatives using visual input 
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